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NOMENCLATURE 

A(x) = associated function = Ax + Axj 

A n (x) = n-point associated function = Ax + Axj + . . . + Ax n . 

f(x) = function whose zeroes are to be t /und 

g(x) = Newton iteration function = x - f(x)/f'(x) 

xj = g(x), an alternate notation 

x p ,Xp* = primary conjugate points, x p < x p * 

x z = x-axis location of a zero of f(x) 

Axj - ith Ax = -f(Xj)/f'(Xj) 
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CONVERGENCE OF NEWTON'S METHOD FOR A SINGLE REAL EQUATION 

INTRODUCTION 


Newton’s method is a well known technique for finding the zeroes of a nonlinear equation. For 
simple functions which can be differentiated, it can be easily programmed on a programmable calculator. 
This ease of programming permits an approach which is termed “experimental mathematics” in this 
paper. Experimental mathematics refers to the use of a computer with a user friendly graphics package. 
Much can be learned about extremely complex functions which are intractable using normal analytical 
methods. Using intuition, and experimental mathematics, a convergence conjecture for Newton’s method 
was formulated. This conjecture is based on finding the zeroes of an associated function which is defined 
below. 


Newton’s method has the following convenient geometric interpretation. Starting with initial guess 
x, calculate the slope of f at x, draw the line through fix), with slope f'(x), and extend the line to the 
x-axis. Where the line hits the axis is the next iterate (Fig. 1). Now consider a specific example, fix) = 
cos(x). For this periodic function, one zero is as good as another, so attention is focused on the zero at 
x z = pi/2. Near this zero, Newton’s method clearly converges. As the initial guess moves away from the 

zero, convergence becomes less certain. Convergence of single point iteration functions is frequently 
investigated using the Contraction Mapping Theorem [ 1 ] . The theorem is stated as follows. 

Contraction Mapping Theorem [ 1 ] : Let g be a continuous iteration function mapping a real 

closed interval, I into itself. Assume a positive constant L < 1 exists such that |f(a) - f(b)| <L|a-b| 
for all a,b in I. Then in I there is a unique solution of the equation x = g(x) for any initial x in I. 



0 . 1 . 2 . 3 . 4 . 5 . 

Figure 1. Newton’s iteration. 





The Contraction Mapping Theorem provides sufficient, but not necessary conditions for con- 
vergence of the iteration function g(x). The conservative nature of the theorem will be demonstrated 
below. 


Consider again f(x) = cos(x). Let an initial guess near the zero x z = pi/2 move gradually away 
from the zero. For a t:,ne as the initial guess moves away from x z , Newton’s method converges. 
Intuitively, a point can be reached where the first iterate is on the opposite side of x z from the initial 

guess and the second iterate is back at the first point. Without computational errors, then Newton’s method 
would oscillate back and for f h between the two points forever. When two such points exist for a given func- 
tion, they will be called conjugate points and are said to be conjugate to each other. This terminology should 
not be confused with conjugates of the complex numbers. Intuitively, Newton’s method might be expected 
to converge inside the conjugate points. The purpose of this paper is to demonstrate that under certain con- 
ditions, Newton’s method will converge within these conjugate points. 

The concept of conjugate points can be made more concrete with some analysis. Newton’s 
iteration is given by the following equation. 


xj = x - f(x)/f'(x) 


( 1 ) 


The second iterate is given in terms of the first in the same fashion. 


X 2 = Xj - f(Xj)/f'(Xj) 


( 2 ) 


If x 2 = x is set, the following equation must be satisfied. 


A(x) = -f(x)/f'(x) - f(x j )/f '(x j ) = Ax + Axj = 0 


(3) 


A(x) is called the associated function in the remainder of this paper. By inspection, fixed points of g(x) 
are also solutions of A(x) = 0. For f(x) = cos(x), g(x) = x + cot(x), and A(x) = cot(x) + cot((x) + cot(x)). From 
the definition in equation (3), A(x) goes to ±°° if f'(x) or f'(xj) go to zero (as long as f does not go to 

zero at x or xj). If a function f has a zero at x z , and has a set of conjugate points, Xp and Xp*, such 
that Xp < x z < Xp*, with no other conjugate points falling in the interval (x p ,Xp*), then x p , and x p * 

are called primary conjugate points . For f(x) = cos(x), a set of primary conjugate points exist around 
each zero. Figure 2 depicts Newton’s iteration between the primary conjugate points around x z = pi/2. 

Intuitively, moving toward the origin from Xp other conjugate points should exist corresponding to 

points on different waves of the periodic function cos(x). This is indeed the case, and f(x) = cos(x) has 
a countably infinite number of conjugate points on (0,pi/2). No pair of these points is conjugate to each 
other but are conjugate to points on the other side of x = pi/2. The first three conjugate points of f(x) = 
cos(x) are depicted in Figure 3. 
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Further examples of functions and associated functions will be given in the next section. With 
these examples, the behavior of the associated function will be illustrated. The purpose of the remainder 
of this paper is to illustrate properties of A(x), and to prove that under certain conditions Newton’s 
method will converge inside primary conjugate points. 
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ASSOCIATED FUNCTION EXAMPLES 


In the previous section, the Contraction Mapping Theorem was given, and described as overly 
conservative. Consider now the case for f(x) = cos(x), with corresponding iteration function g(x) = x + 
cot(x). For the theorem to apply, g must satisfy a Lipschitz condition which may be restated as 
ig'(x) I < 1 on the convergence interval. For the fixed point at x z = pi/2, the corresponding interval is 

(pi/4, 3pi/4) which is approximately (0.7854, 2.3562). In actual fact the convergence interval is more 
like (0.4052, 2.736). Obviously, the Contraction Mapping Theorem is quite conservative. 

The actual convergence interval for f(x) = cos (x), is given by approximate values of the primary 
conjugate points associated with the zero at pi/2. In the previous section, three sets of conjugate points 
were given graphically. Table 1 gives numerical values to several decimal places. These conjugate points 
were calculated using an HP 1 1C programmable calculator, and they may not be accurate in the last 
one or two decimal places given. In fact, for f(x) = cos (x) the zeroes of the associated function were 
found using Newton’s method. For other more complicated functions, differentiation of A(x) is difficult, 
and solution by the secant method [2] was convenient. 

TABLE 1. SOME CONJUGATE POINTS FOR fix) = cos(x) 


0.405235142 

0.166506332 

0.108172203 

0.080435490 

0.064097883 

2.975086322 

3.033420450 

3.061157163 

3.077494771 


2.736357512 

6.116678975 

9.316605758 

12.48593512 

15.64386539 

-2.975086322 

-6.175013104 

-9.344342470 

-12.50227273 


While in principle, Newton’s method will iterate for all time between the pairs of conjugate 
points given above, in practice the iteration begins to wander away from the points because of roundoff 
and function evaluation errors. For the primary conjugate points, on the author’s HP 1 1C, the iteration 
wanders away only in the last decimal piace on the first iteration for the primary conjugate points. For 
succeeding iterations, the divergence continues. Intuitively, from the geometrical interpretation of 
Newton’s method, one might expect that the farther the distance between the two conjugate points, the 
faster the divergence. This is indeed the case. If one takes the values for the primary conjugate point Xp 

and adds 1 in the last decimal place to put an initial guess infinitesimally inside the convergence interval, 
the iteration will begin to converge. If on the other hand, 1 is subtracted from Xp, in the last decimal 
place, the iteration begins to diverge. 

Location of the conjugate points of cos(x) is a hit or miss proposition, (mostly miss) and the 
higher order conjugate points are hard to find because of the discontinuous nature of A(x). A(x) can be 
studied quite easily, however, with a set of good computer graphics routines. If fix) can be differentiated, 
A(x) can be plotted. The graphical exploration of the complex composite function A(x) falls in the 
realm of experimental mathematics and this section is devoted to this exploration for several examples. 



The periodic function cos(x) is generalized by adding a constant so that f(x) = cos(x) + c. Figures 
4 and 5 are plots of f and A for different values of c. For Xp = 0.405 . . . observe that A(x) crosses the 

x axis with positive slope. The slope of A(x) can be calculated in general. 

A'(x) = -1 + (f(x)f"(x)f(x 1 )f''(x 1 ))/(f'(x)f'(x,)) 2 (4) 


From equation (4), if T does not go to 0 at a zero of f, then at the zero the slope of A is -1. If A is 
continuous on (Xp,x p *), then it is positive on (x p ,x z ), and negative on (x z ,x p *). For every zero of 

f(x) = cos(x), this situation applies. As one looks at examples, this behavior will be observed frequently. 

Figures 4 and 5 show that as long as c lies between 0, and 1, A(x) varies rather smoothly with c. 
As soon as c goes to 1 and then to greater values, the character of A(x) changes abruptly and radically. 
Since F(x) = cos(x) + 1.001 has no zeroes, the sudden change is not unexpected. Observe that just 
because a function has no zeroes, the function may still have conjugate points as is the case for cos(x) + 
c where c is greater than 1. 

One of the purest examples possible of primary conjugate points and Newton’s method converg- 
ence is given by f(x) = arctan(x). For this case, both f(x) and A(x) are continuous on the infinite interval, 
f(x) has one zero, and A(x) has three zeroes corresponding to tv/o primary conjugate points and one zero 
at the origin. Figure 6 depicts these functions. Table 2 gives numerical values for the conjugate points of 
arctan(x), and the first few iterations for different guesses. 

TABLE 2. CONJUGATE POINTS AND ITERATIONS FOR 
f(x) = arctan(x) 


x p = - 1.39174520027, x p * = 1.39174520027 


Iteration 

x= 1.391746 

x = 1.391745 

1 

-1.391747 

-1.3917447 

2 

1.391751 

1.3917438 

3 

-1.391760 

-1.3917415 

4 

1.391784 

1.3917355 

5 

-1.391847 

-1.3917196 

6 

1.392015 

1.3916777 

7 

-1.392457 

-1.3915672 


The iterations given in Table 2 are a dramatic demonstration that Newton’s method converges inside 
primary conjugate points. Although primary conjugate points exist for many functions, some functions 
do not have conjugate points, and the only solutions to A(x) = 0 are those for which f(x) = 0. 

The next example presented is f(x) = x + c which is shown in Figure 7. For c = -1, f has zeroes 
at ±1. This is an example of function which is concave upward (f"(x) greater than 0) on the infinite 
interval. Functions which do not change curvature at least once cannot have primary conjugate points. 
For the case c = -1, the only discontinuity in Afx) occurs at f'(x) = 0, i.e., at x = 0. Because of the 
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concavity of the function and its position relative to the x axis, no Newton iterate xj can be equal to 

zero. The spikes in A(x) normally associated with f'(xj) = 0 do not occur for this case. The slope of 
A(x) at the two zeroes is -1 as before. 

For the case when c = 0, the form of A(x) changes drastically. The spike disappears and A(x) 
becomes a straight line through the origin with slope -3/4. This is the case where both f and f' go to 

zero at x z . Consider the general case of f = x n . In this situation, A(x) is always a straight line through 
the origin with the following slope. 


A'(x) = -(2 - 1 / n )/ n 


(5) 


Notice that if n = 1/2 corresponding to f(x) = x*^, A(x) = 0 for all x. In fact, if x is defined 

as (-x)^ for negative x, and x^ for positive x, it is found that every point is a conjugate point! 
This remarkable fact can be approached from another direction. Consider an odd function, i.e., f(x) - 
-f(~x). For every point to be a conjugate point, -x = x - f(x)/f'(x). Solving this little differential equation 

gives f = kx*/^. 

For the case, f(x) = x + 0.1, the character of A(x) again changes. When c was negative, A had 
one spike at the origin. When c was 0, A was a well behaved straight line. For c positive, A has three 
spikes. These three spikes correspond to the one place where f'(x) = 0, and the two places where f'(Xj) = 

0. Notice that there are two conjugate points on either side of the origin, but they are not primary con- 
jugate points because there is no zero in between. 

The next example, depicted in Figure 8, is f(x) = x 3 - x + c. For c = 0, this function has three 
zeroes at -1, 0, and +1. The function also has a point of inflection at the origin. A point of inflection 
is a necessary condition for the existence of primary conjugate points and the zero at x z = 0, has two 

primary conjugate points. Again, it is seen that the spikey nature of A(x) corresponds to f'(x), and 

f'(\j) = 0. The slope of f goes to zero at ±3 - ^ 3 . For c = 0, Newton’s method converges to the zeroes 
at ±1 for guesses outside the zeroes of f'(x). 

So far the fact that conjugate points can be overlapped has not been emphasized, i.e., x c j < x Q 2 
<x c l*<x c 2 *. In fact, the construction of a function with conjugate points ai any location, with any 

order of the conjugate points is possible. Geometrically, locations are chosen where conjugate points are 
desired and the necessary line is drawn in with the correct slope that intersects the x axis at the desired 
location of the corresponding conjugate point. Do this for each set of desired conjugate points, and then 
draw in a curve through the desired values at the conjugate points and tangent at these points to the 
previously drawn lines and the result is a function with the desired properties. Mathematically, one can 
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do the same thing with a polynomial function. For example, consider the odd function f = cjx + C 2 X + 
cjx^ + c 4 x 7 . This function should satisfy f(— 2) = 2, f(— 1 ) = 1, f(0) = 0, f(l) = -1, and f(2) = *-2. In 

addition, corresponding conjugate points are needed at -2 and 1, and at -1 and 2. The required values of 
Cj can be found and the result is given below. 


f(x) = -29x/18 + 67x 3 /72 - 13x 5 /36 + x 7 /24 


( 6 ) 




at - 


I 


4 ft •. k\l%. 


This function is plotted in Figure 9. One purpose of this exercise was to see if a function can have 
overlapping conjugate points without generating primary conjugate points inside the innermost con- 
structed conjugate points. For this example, a set of primary conjugate points were generated inside 
(-1,1). These occurred at x p = -0.71881058427 and Xp* = 0.71881058427. This result raises the ques- 
tion of whether it is possible to have a well behaved associated function without primary conjugate points 
but having overlapping nonprimary conjugate points. In this simple example, the hypothesized situation 
did not materialize. This point is still under investigation because, if possible, it limits the usefulness of 
the convergence theorem to be proved in the next section. 


Usually, the composite associated function is more complex than f(x). For f(x) = tan(x) + c, the 
opposite is true. The plot for this case is shown in Figure 10. For the tangent function, fixed points 
occur at 0, ±pi/2, ±pi, ... The fixed point at x = 0 is a legitimate zero of tar.(x), but the fixed point at 
pi/2 corresponds to tan(x) = °° Because f'(pi/2) = Ax goes to zero. For any initial guess, other than 
where the tangent is ±°°, Newton’s iteration converges to the nearest zero of the function. 


Figure 1 1 corresponds to fix) = arcsin(x). The associated function seems to have the desirable 
form with three zeroes, expected of primary conjugate points and a zero. This is not the case because 
the two outside solutions correspond to fixed points of Newton iteration and, hence, are not conjugate 
points. 

In this section, the wonderfully varied nature of the associated function A(x) was investigated. 
The examples make plausible the hypothesis that when primary conjugate points exist around the zero 
of a function, then Newton iteration converges for initial guesses between x p and x p *. Under certain 
conditions, this conjecture is proved in the next section. 


CONVERGENCE INSIDE PRIMARY CONJUGATE POINTS 


The proof of the conjecture that Newton’s method converges inside primary conjugate points 
will be outlined before the actual proof so that some understanding of the approach is possible. The 
hypotheses of the Theorem are chosen so that the “standard” form of A(x) is assured. This form is 
A(x p ) = A(x ? ) = A(x p *) = 0, and A(x) negative on (x p ,x z ), and positive on (x z ,x p *). Once this is 

achieved, then the only stumbling block is that the iteration once inside the conjectured convergence 
interval may wander back outside. For the “standard” form of A(x), this is shown to be impossible by 
contradiction. Then, because an initial guess on either side of the zero and in the interval will two 
iterations later be closer to the zero because of the positive and negative nature of A(x), then 
convergence is assured. The theorem follows. 


Assumptions 

1. fix) and fixj) are continuous, and have continuous first and second derivatives on an interval 

la,b]. fix) has a zero at x z and corresponding primary conjugate points at x p and x p * lying inside 
the interval [a, b] . 

2. f(x p )f"(x p ), f(x p *)f"(x p *), and f"(x p )f"(x p *) < 0. 

3. f'(x), and f'(xj) ^ 0 on [x p ,x p *]. 
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Figure 1 1 . Function and associated function for f(x) = arcsin(x). 


Conclusion 

Newton iteration converges to x, for initial guess x in (x n ,x *). 

L P P 

From the hypotheses, g(x) is continuous on (Xp,Xp*), and A(x) = 0 only at the two conjugate 
points and at the zero. At the zero, A'(x z ) = - 1 . Then A(x) is positive on (x p ,x z ), and negative on 
(x z ,Xp*). The proof follows quickly, if no point xj in I = ( x p» x p*) exists such that g(x-p) > Xp* or 
< Xp. Consider the first case. Since g(x-p) > Xp*, and since g is continuous, then there exists a point 
x fl on (xj,x 2 ) such that g(x a ) = x p *. Then A(x a ) = Xp - x a < 0 . This cannot be since it has already been 
shown that A is positive in the interval to the left of x z . In order to have x j exist, another conjugate 

point is required which violates the hypothesis of primary conjugate points. Now for x on I and less than 
x z , x < \2 < x 4 < ... < X2 n where x, ..., X2 n _2 are in Ij = (Xp,x z ), and x-> n is in I2 = (x z ,x p *). Then, 

because A is negative to the right of x z , X2 n > x 2 n +2 > > * 2 \ where the iterates from x^ n to x 2j_-» 

are all in I2 x 2j falls in jl. This process is continued indefinitely, each time falling closer and closer to 
x z , so that convergence to x z is clear. 

This . concludes the proof. The hypotheses are stringent and may be overly restrictive. From the 
examples of the previous section, many simple functions satisfy the hypotheses. 
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N POINT ITERATION CYCLES 


The form of the iteration function A(x) = Ax + Axj is suggestive. Do functions have three or 

more points that Newton iteration can oscillate between? The answer is yes, and the form of a 
generalized associated function whose zeroes give these n point cycles is given as follows. 

A n (x) = -f(x)/f'(x) - f(xj)/f'(x,) -... - (J0) 

With this notation, A(x) = A->(x). From the definition of A n (x), zeroes of fix) will also be zeroes of A n . 
Furthermore, zeroes of A 2 will be zeroes of A 4 , A^, .... and zeroes of A^ will be zeroes of A^,... and so 
on. An example for a three point cycle for fix) = cos(x) is given in Table 3. 

TABLE 3. A THREE POINT CYCLE FOR fix) = cos(x) 

xj = 0.36658570322905; 
x-j = 2.97 11 5658664654 
x 3 = -2.83922379060465 

SUMMARY 


A conjecture which gives improved convergence intervals for Newton’s method was proved under 
certain limiting conditions. The convergence intervals are defined by finding zeroes of an associated 
function, A(x). This function was given in general form for Newton’s method, and for specific functions 
fix), can be quite complex. For any given example where f and its derivative can be calculated, A(x) 
can be plotted. From these plots, a wealth of information can be gleaned. The use of computer graphics 
to explore properties of complicated functions is one form of a technique that the author refers to as 
“experimental mathematics.” 

The theorem developed in this paper is encumbered by some stringent assumptions which can in 
all probability be relaxed. The properties of the associated function are extremely nch and varied. The 
author has filled up one notebook in his explorations of A(x) and has certainly only scratched the 
surface. A(x) seems to be like unexplored territory, with many secrets yet to be uncovered. 
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